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We consider the problem of distinguishing between a set of arbitrary quantum states in a setting 
in which the time available to perform the measurement is limited. We provide simple upper bounds 
on how well we can perform state discrimination in a given time as a function of either the average 
energy or the range of energies available during the measurement. We exhibit a specific strategy 
that nearly attains this bound. Finally, we consider several applications of our result. First, we 
obtain a time-dependent Tsirelson's bound that limits the extent of the Bell inequality violation 
that can be in principle be demonstrated in a given time t. Second, we obtain a Margolus-Levitin 
type bound when considering the special case of distinguishing orthogonal pure states. 



I. INTRODUCTION 

Entropic measures tell us how much information a 
quantum register E contains about some classical register 
X in principle. But just how quickly does this informa- 
tion become available to us? In this little note, we derive 
bounds on the amount of information available after a 
given time t. As expected, our bounds depend on the 
resources we have available in the form of the available 
energy. 

Throughout this paper, we will choose to measure in- 
formation in terms of the min- entropy, which is the rele- 
vant quantity when we consider single-shot experiments 
and quantum cryptography. As we will explain in detail 
below, this measure is directly related [T] to the proba- 
bility of success in state discrimination [2H6]. As a result, 
we focus on bounding the probability of success in dis- 
tinguishing states {px}xex where we are given with 
probability px- Let Pgucss{X \E)H,t denote this success 
probability after time t when using a particular Hamil- 
tonian H in the measurement process. After providing a 
more careful discussion of the measurement process, we 
show the following results. 



A. Results 

A bound for two states: We first consider the case of 
only two input states po,pi, for which it is easy to com- 
pute the optimal success probability if we have unlimited 
time (or resources) available [2^ . We first provide a gen- 
eral bound in terms of the spectrum of the Hamiltonian 
(Corollary B.2|. For the special case of of two equiprob- 
able states (po = pi = 1/2), this bound simply reads 
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(1) 



where D(pQ,pi) is the trace distance between the two 
states, and 7 is a small constant. This bound is directly 
related to our ability to distinguish two inputs states 



given an unlimited amount of time, where the best mea- 
surements gives us P] 



s{X\E) = - + 



1 , D{pQ,pi) 



(2) 



We proceed to show that our bound is nearly tight up 
to a constant factor (Theorem D.l) by providing an ex- 
plicit measurement strategy. Finally, we prove a bound 
in terms of the average energies of the input states (The- 
However, this bound does not compare as 



B.3I 



easily to the case of unlimited time. 

A bound for many input states: When considering the 
case of an arbtirary number of input states po, . . . , pn-i 
it is difficult to compute the maximum success probabil- 
ity even in the case of unlimited time. In particular, no 
general closed form expression is known - only for the 
case of single qubit encodings does there exist a way to 
construct the optimal measurements geometrically [7] . In 
general, we can only approximate the optimal measure- 
ments numerically [SJ HHH] , or resort to bounds on the 
success probability [41 IT5H20] . As such, it becomes harder 
to relate Pgncss{X\E) u ,t to case of unlimited time. We 
hence provide a general bound in terms of the average 
energies alone. In particular, we show (Theorem C.l) 
that 



P. 



.{X\E)<p., 
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where Xmax is the smallest x G {0, . . . , — 1} such that 
Px„ax ^ Px for all X. 

Applications: Finally, we discuss two applications of 
our bound. The first is to the study of Bell inequali- 
ties |21j . Typically, we care about determining the max- 
imum quantum violation of such inequalities. In con- 
trast, we ask what is the maximum violation that can 
be achieved in a fixed amount of time. When consider- 
ing such inequalities as games between two players Alice 
and Bob (see Section IVB 1 ), the "amount" of quantum 



violation is determined by the probability Pwin that the 
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players win the game maximized over all states and mea- 
surements. For the CHSH inequality [22j, we have that 
classically 



< 



(4) 



for any strategy of Alice and Bob. However in quantum 
mechanics there exists a strategy that achieves 
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which is optimal [531. Here, we show (Corollary F.2) that 



if we demand answers from Alice and Bob after time t 
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where H is Bob's Hamiltonian involved in the measure- 
ment process, and 7 is a small constant. We will also see 
that to achieve Tsirelson's bound, Alice and Bob need 
time at least 



t > 



(7) 



Our bounds tell us that there does indeed a fundamental 
time that is needed to establish non-local correlations of 
a certain strength. We will discuss these bounds in detail 
in Section HVBl 

As a second application, we use our bound to obtain 
a form of the Margolus-Levitin theorem [21] which pro- 
vides us with a lower bound on how much time it takes 
to transform a pure state into an orthogonal state. Since 
the Margolus-Levitin theorem provides a bound on the 
speed of evolution, it clearly provides a bound on the 
minimum amount of time that is required to obtain the 
optimal (time unlimited) success probability for state dis- 
crimination. Yet, note that we are interested in bounding 
Pgucss{X \E)H,t even for shorter periods of time. We will 
discuss the relation of our work and the Margolus-Levitin 
theorem in detail in Section HV Al 



B. Related work 

Next to the Margolus-Levitin theorem [21] , our work is 
related to several bounds [551 US] on how fast information 
can be transmitted in principle given energy constraints 
(see [271 for a survey of results) . These bounds generally 
consider the von Neumann entropy as a measure of infor- 
mation and are concerned with determining the capacity 
for sending information as a function of energy. That is, 
they consider how fast we could convey information in 
the best possible way. In contrast, we consider the case 
of arbitrary encodings px, which may not be optimal to 
transmit classical information. In fact, even in the case 
of unlimited time the probability that we can reconstruct 
X from px could be very small. Our setting also differs 



in the sense that we focus solely on extracting classical 
information into a classical register in a sense that we 
will make precise below. 

Our work is also related to several previous papers [28J- 
150] that study the rate of change in entropies of a system 
that is in contact with an environment. Again, our work 
is a somewhat different flavor since we are interested in 
extracting classical information, and our bounds further- 
more involve average energies, rather than the largest 
energy ||i?||oo of the (interaction) Hamiltonian H alone. 



II. GAINING CLASSICAL INFORMATION 
A. Quantifying information 

Let us now consider more formally what we mean by 
gaining classical information encoded in a quantum sys- 
tem. Imagine that there is some finite set X of possible 
classical symbols to be encoded. For any symbol x & X, 
we thereby use px & l3{Hcnc) to denote its encoding into 
a quantum state on the system "Hcnc- We also refer to 
Hcnc as the encoding space. Our a priori ignorance about 
the classical information x is captured by the probability 
distribution px according to which the encoding space is 
prepared in the state px- 

Throughout, we quantify how much information we 
have about x given access to the encoding space Henc in 
terms of the min-entropy [1] 



Roo{X\E) -logPgues«(^|-B) 



(8) 



where 



sup 



Y^P^^^iMxPx) , (9) 



is the probability that we guess x, maximized over all pos- 
sible measurements on the encoding space. Finding the 
optimal measurement is known as state discrimination 
and can be done using semidefinite programming [51 [5] . 
The min-entropy accurately measures information in a 
cryptographic setting [M], and for single shot experi- 
ments. This is in contrast to the von Neumann entropy 
which is concerned with the asymptotic case of a large 
number of identical experiments. 

The min-entropy and the von Neumann entropy can 
be arbitrarily different, as is easily seen by consider- 
ing the example where the encoding is trivial, that is, 
Px = Px' for all X and x' . The strategy that maximizes 
the guessing probability Pgucss{X\E) is then simply given 
by outputting the most likely symbol, i.e., lloo{X\E) = 
— log maxx Px |38| , and the conditional von Neumann en- 
tropy obeys ii{X\E) = H(X) = ~J2xP=^^°SPx- Con- 
sider now S = {0, 1}"' to be the set of bitstrings of length 
n and suppose the all '0' string occurs with probability 
Po" — 1/2, and with probability 1/2 any of the remaining 
strings occurs with equal probability. Clearly, we have 
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Hoo(X|£;) = 1, whereas H(X) pa n/2. That is, the von 
Neumann entropy can be very large, even if there is one 
symbol that occurs with extremely high probability. We 
will remark on the rate of information extraction from a 
quantum system in terms of the von Neumann entropy 
later on, but focus on the single shot case given by the 
min-entropy, or equivalently the probability of error in 
state discrimination. 



See Figure [T] for a schematic depiction of this process. 
Note that the ancilla is measured by the referee at no 
time cost. This is a natural assumption in our setting 
where we imagine that the final information is extracted 
by a referee who is not limited by any energy constraints. 
Such a referee naturally arises in, for example, the setting 
of Bell inequalities which we consider later. We will from 
now on assume that measurements producing classical 
outcomes are always performed this way. 



B. Producing a classical output 

To determine how quickly we can acquire classical in- 
formation, we first need to specify what it means to out- 
put classical information from a measurement. Here, we 
model this process with the help of an additional 'clas- 
sical' ancilla system 'Hanc that contains the output. A 
classical system is associated with a fixed basis, which 
without loss of generality we take to be the computa- 
tional basis. Preparation and measurement of a classical 
system can only be done in this basis, which intuitively 
corresponds to the idea of storing classical information: 
The ancilla can be prepared in any state of the fixed ba- 
sis, and is subsequently measured in this basis after time 
t. The information contained in this register captures 
the notion of a classical probability distribution over the 
basis elements. 

We model the process of state discrimination as fol- 
lows. The problem is to discriminate between N states 
Pj. on the encoding space Jienci where A'^ is the number 
of possible classical symbols. At the beginning of the ex- 
periment the ancilla system is initialized to the symbol 
occurring with the largest probabily |x,„ax) where 



argmax^ Px 



(10) 



This initial condition captures the distinguisher's apriori 
knowledge: recall without access to the quantum regis- 
ter Hoo(A|£') = — logpa;^^^. If the there are multiple 
classical symbols with the same value Px^^^ , we take the 
smallest one in lexicographic order. We will discuss the 
choice of initial state in detail below. The ancilla system 
has total dimension — N and the other directions 

correspond to the classical symbols x. The experimenter 
implements a unitary U on Hcnc ^ Hanc during a speci- 
fied time t. At this point the ancilla system is passed to 
a referee who will decide whether information has been 
gathered successfully by measuring 'Hanc in the compu- 
tational basis, using measurement operators 



Px ■■= \x){x\ 



(11) 



where the subscript x denotes the corresponding classical 
output. Hence the success probability of correctly iden- 
tifying the state using this procedure when the ancilla 
was initially in the state |xmax) is given by 
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2 Apply unitary 



FIG. 1: Our protocol for distinguishing quantum states in 
finite time. First, the encoding register is placed into an en- 
coding px of the classical symbol x chosen with probabiHty 
Px- The ancilla is intialized in the state |xmax). Second, we 
can perform a unitary interaction U = exp{—iHt/h) for time 
t between the encoding and the ancilla register. Finally, the 
ancilla register is measured by the referee in the computa- 
tional basis to determine a guess x' for x. If x' — x, then we 
successfully recovered the classical information. In the setting 
of Bell inequalities considered later on, the ancilla register is 
simply the message returned to the referee. 



To bound how much min-entropy we have after time t, 
our goal is to place bounds on the success probability in 
terms of the unitary 



U = exp 



iHt\ 



that is, in terms of the interaction Hamiltonian 



H — ^ En\En) (E„ 



(13) 



(14) 



and the time t. Throughout, we will assume that H > 
and that the lowest energy level is in fact Eq ~ 0. 
Any other Hamiltonian differs from such an _ff by a term 
proportional to the identity, which does not contribute 
to the speed of information gain. We explictly chose not 
to use the common convention h = 1 to make it easier to 
draw comparisons to the Margolus-Levitin theorem [21] 
later on. 

Before turning to our actual bounds, let us first in- 
troduce some additional notation which we will refer to 
throughout the paper. We will use 



tr((I«_(»P,)C/(p,0|x„ax)(x„iax||)f/^) • (12) 



Px ■= Px 



)(x„ 



(15) 
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to denote the combined state consisting of the input state 
Px on the encoding space, and the initial state of the 
ancilla |x,„ax)(xmax|- We also write 

R:=U-I = ^(cxp(-^S„^/;^) - l)\En){E„\ . (16) 



Furthermore, it will be convenient to rewrite the success 
probability ( |12[ ) in terms of measurement operators 

:-C/t(I®P,)[/ = I®P, + W, , (17) 

as tr{AIxPx), where 

Wx = + , (18) 

and 

:= (I ® Px)R + ® Px) , (19) 

-.^ R^l(E) Px)R . (20) 

The average success probability for a particular Hamil- 
tonian H and time t can now be written as 



Psnc.s{X\E)H,t := ^P.tr(A/4p,) . (21) 



III. TIME VS. INFORMATION GAIN 

We are now ready to derive our bounds. For simplicity, 
we will outline how this can be done for the case of two 
equiprobable states, and merely state our general result. 
Precise statements as well as a detailed derivation can be 
found in the appendix. 



A. An upper bound to Pgucss{X\E) 

We now first derive an upper bound to the guessing 
probability. For the case of two equiprobable states (i.e., 
N = 2 and = 1/2 for all x e A", such bounds are easy 
to obtain when we allow unlimited time (or energy). In 
particular, it is well known that in this case the success 
probability is given by |2] 



-Pgucss(^|-£') :- 



D{pQ,Pi) 



(22) 



where D(poj Pi) — jIIpo — Pi||i is the trace distance of the 
two states. Let us now consider what happens in our time 
limited scenario for a particular interaction Hamiltonian 
H. First of all, recall that for two equiprobable states, 
the ancilla is initialized to the smallest value 



|0) 



(23) 



For two states, the success probability Pgucc averaged 
over the choice of input state, using the measurement 



given by operators Mi and Mq = I — Mi from (17), can 
now be expressed as 



succ {X\E)H,t 

= ^[tr(Mopo)+tr(Mipi)] 
= ^[l+tT{Mi(pi-po))] 

= ^ [1 + {Pl - Po) tr (-Pl|Xmax)(Xmax|) + 
tl (WiiPi - po))] 

^ 1 trjWiipi^po)) 
2 2 



(24) 
(25) 

(26) 
(27) 



where the fourth equality follows immediately from the 

fact that Pl|Xmax)(Xmax| = |Xinax) (Xmax | A = 0. Let US 

now upper bound the term involving Wi. Again using 

that Pl|Xi„ax)(Xmax| = |Xmax) (Xmax | A = 0, WC havC 



tr {Wlipi ~ Po)) ^ . 



(28) 



Define A := pi — po, and consider its diagonalization 
A ^ Ei-^i|wj)(wil- Let A+ := Ej,a,>o ^^d 
A~ := A — A'^ . Using the fact that i? • i?Ms a positive 
map "S^] and < I® P^; < I, we can now bound the term 



tr {wlA^ < tr 



(29) 



< 2Y,{^ - cos{tEn/h)){En\A+\En). (30) 



Substituting back into our original bound ( 27 ) gives us 



Psncc{X\E)H,t < 2 + - cos{tEn/h)){En\A+\En) . 

n 

(31) 

This is the basic inequality that we can use, along with 
some restriction on the allowed energies En, to bound 
the success probability for state discrimination in time 
t. In the rest of the paper we will apply this in two 
main settings, bounded maximum energy, and bounded 
average energy. 



A bound m terms of the maximum energy 



From (31 



success pro 



we can immediately obtain a bound on the 
)bability for state discrimination in terms of 
the maximum energy ||oo of the coupling Hamiltonian 
H. (||i?||oo is just the largest eigenvalue oi H.) This 
bound is attractive since it is simple to derive and has 
the appealing feature that it involves the trace distance 
between the two states, and is thus directly related to 
the probability that we distinguish the two states given 
an unlimited amount of time. However, there are many 
systems of physical interest where the maximum energy 
of system states is effectively unbounded. Even though 



5 



we may without loss of generality assume that the spec- 
trum is bounded for a particular set of input states (see 
appendix), this bound is nevertheless quite unsatisfying 
in these situations since it can be very weak. For this rea- 
son, we use the fundamental inequality (31) in the next 



section to derive a bound on the success probability that 
depends only on the average energy. 

Note that since tr(A+) = D{pQ,pi) — D{pQ,pi) we 
immediately obtain that the success probability obeys 



1 



(32) 



< 



(1 - COs(<C,nax/^))£'(P0, Pi) 



where Cmax = argniax£;^(l — cos{tEn/h)). If tEn/h < 1 
for all n, then this upper bound simply reads 



^ succ 
1 



(33) 



< 



{l-cos{t\\H\\^/h))D{po,pi) , 



which will be useful for comparison below. For larger 
values of tEn/h it is easy to see that 



where 



S/tt if 1< tEn/h < 4 , 
3/tt otherwise . 



(34) 



(35) 



2. A bound m terms of the average energy 

A sometimes more satisfying bound can be obtained 
in terms of the average energy. Note that we can upper 
bound (31 1 as 



1 7i 
2¥ 



Y,En{En\A+\En) , 



and hence we may use the fact that 

A+ = ^(A+ -A- 
2^ 



2^ 



A- 



1,. 

^{Pi - Po) + ^\Pi 



Pol 



to obtain 



Psucc{X\E)H,t 

1 , jt{tr{H\pi- po\)+tiiHpi)-tiiHpo)) 



(36) 

(37) 
(38) 

(39) 



< 



Ah 



Now, the asymmetry between the labels and 1 is 
inessential. The bound is true if we swap the two state 
labels, as may be seen by repeating the above derivation 
swapping the role of the two state labels. Averaging these 
two bounds we find the following symmetric bound 



PsucciX\E)H,t < 



1 7Ur(g|pi-/}o| 

2 Ah 



(40) 



This bound should be compared with the bound ( 33 ) in 



terms of the maximum energy in which the trace distance 
appears. The quantity on the right hand side of (|40|) is 



loosely an energy-weighted trace distance. Whereas this 
bound is certainly stronger for a particular choice of i?, it 
does not any longer bear an obvious quantitative relation 
to the Helstrom bound in terms of the trace distance. In 



deriving ( 40 1 we have made use of the knowledge of the 



optimal measurements for distinguishing a pair of states. 
This is no longer possible in more complicated cases, even 
where unlimited time is allowed [5]. We can weaken the 
bound somewhat, using the fact that p, H > to obtain a 
bound explicitly in terms of the average energy as follows 



< 



ICC 

1 , -fttT[H{po 



(41) 



Po) 



Ah 



So we see that the average energy of the joint system 
and ancilla place a bound on the success probability of 
state discrimination, as claimed. This bound may be 
generalized easily to the case of more than two classical 
symbols and an aribtrary distribution {px}x- We show 
in the appendix that 

Theorem III.l. Suppose H > 0. Then the probabil- 
ity of distinguishing po, . . . , Pn~i given with probabilities 
Po, . ■ . ,Pn-i using the Hamiltonian H obeys 



PsucciX\E)H,t 



(42) 



It 



N-1 



^ Px tr (Hpx) , 



a;=0 



whe 



7 



5/7r ifyEn,l < tEn/h < A , 
S/tt otherwise . 



(43) 



Note that the term J^xPx {HPx) is the energy of the 
encoding and ancilla register averaged over the choice of 
input symbols. 



B. A lower bound on Pguoas(-'f 

We now exhibit a specific measurement strategy for 
two equiprobable states, which attains our upper bound 
up to a constant factor. We again focus on the case 
of two possible input states, as for the general setting 
there is no analytic procedure of obtaining the optimal 
measurements even in the setting of unlimited time. Our 
construction for two states will make explicit use of this 
optimal measurement. 

Let A = pi — pq. It is well known |2] that the optimal 
distinguishing measurement in the time unlimited case 
without the use of an ancilla is given by {11^1+, n^j-}, 
where 11^+ and 11^- are projectors on the positive and 
negative eigenspace of A respectively. To construct our 
Hamiltonian H, let us diagonalize A — 
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and define := ^ 
Consider the operator 



jAj>0 



and A 



A- 



i + n^+®(|o)(i| + |i)(o|) . 



A+-A. 



(44) 



Clearly, H is Hermitian and unitary, and hence has eigen- 
values ±1. In fact, H is the unitary we would use to 
achieve the optimum distinguishing probability if we were 
unconcerned with time. We now define a Hamiltonian H 



(45) 



For comparison with our upper bound of ( 33 ) H obeys 
the condition H > and has largest eigenvalue equal 
to -Emax = ll^lloo- A simple calculation provided in the 
appendix shows that for our choice of H we have 



PsucciX\E)H,i 



1 



(1 - cos(i||iJ|| 



(46) 



which gives a lower bound to Psucc{X\E) maximized over 
all possible H in time t. This bound matches the upper 
bound of ( 33 ) up to a factor of 1/4. 

Note that H effectively implements a variant of the 
controUed-NOT (c-NOT) operation on the encoding 
space and the ancilla. For more than two inputs states, 
one could construct a similar H implementing a con- 
trolled addition mod N on the ancilla, as long as the 
optimum distinguishing measurement in the case of un- 
limited time is a projective measurement on the encoding 
space. This would give a similar relation between time 
and the original probability of distinguishing the given 
states. However, it is known that there do exist choices 
of encodings such that the optimum measurement is 
not projective, and hence we omit this restricted form of 
generalization. 



IV. APPLICATIONS 

Let us now consider several applications of our simple 
bound. 



A. Minimum distinguishing time and the 
Margolus-Levitin theorem 

The first application we are interested in, is a return 
to our initial question: Just how quickly can we acquire 
information? That is, what is the minimum time needed 
to extract classical information encoded in a quantum 
system? Note that with the Hamiltonian H in the lower 
bound for two equiprobale states, there does indeed exist 
a way to optimally distinguish the two states in time 
t = /i7r/||iJ||oo. However, since there is a small gap to our 
upper bound it would be an open question, whether it is 
possible to achieve the same in an even shorter amount 
of time. 



1. Minimum time 

Yet, note that our upper bounds on PsucciX\E)H.t can 
also be understood as lower bounds on the time required 
to optimally distinguish the given states, retrieving the 
maximum amount of information from the encoding. Let 
us first consider our most general bound for large X. 
We have that if we can distinguish optimally in time 
^distinguish our upper bound must be at least as large as 
the optimum Pguoss(^ !-£')• That is. 



T^distinguisli 



N-1 



ucss 

(47) 



x=0 



and hence 



(Pguoss(X|i?)-p.„.J^ . 
'■distinguish^ , /rr~ ^ ' 

7L^=0 PxtT{Hp^) 



2. Margolus-Levitin theorem 

Let us now consider the special case where two 
equiprobable input encodings are perfectly distinguish- 
able. That is, po = |0)(0| and pi — |1)(1|. Our task is 
now quite simple: We merely wish to turn the state |1) |0) 
of the encoding and ancilla system to the state 1 1) 1 1) , that 
is, we wish to transform one vector into its orthogonal. 
Note that given unlimited time (or energy) we can suc- 
ceed perfectly at this task and hence Pgucss(-^|£') = 1- 
From ( 48 1 we thus have 



^distinguish ^ 



27tr(i/pi) 



(49) 



Our bound can hence also be understood as putting a 
limit on the time that it takes to turn a state vector 
to its orthogonal (on the ancilla), given some additional 
resource (the encoding register). 

A bound on the minimum time that it takes to 
turn a vector into its orthogonal is indeed known as 
the Margolus-Levitin theorem [53]. In particular, their 
bound applied to our situation involving both the encod- 
ing and the ancilla register gives 



2tr {Hpi) 



(50) 



Such a bound had previously only been derived from the 
time-energy uncertainty principle where instead of the 
average energy, we have the energy spread, i.e, the differ- 
ence in the largest and smallest eigenvalue of the Hamil- 
tonian (see [33] for a review of history). The Margolus- 
Levitin theorem has been used to place bounds on the 
fundamental speed of computation [33], and was even 
slightly improved for some special cases [331. Note how- 



ever that for the Hamiltonian constructed in (451 we have 
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tr{Hpi) = £'i„ax/2 and hence the bound provided by 
Margolus-Levitin is in fact tight as we know that (45) 



lets us achieve the optimum success probabihty in time 
t = hiT I E-axavi- This shows that it is our upper, rather 
than our lower bound that can be improved. 

Since we have ^ — Z/ttot:^ — bjir depending on 
the parameters, our bound is slightly worse than the 
Margolus-Levitin bound which stems from our somewhat 
crude bound on (1 — cos{tEn/h)). Note, however, that 
our bound considers a more specialized situation, namely 
turning the ancilla to its orthogonal given the encoding, 
but in turn applies to any kind of input states. 

That we obtain a Margolus-Levitin type theorem as a 
side effect of our analysis is not very surprising: Clearly, 
the speed of dynamical evolution places a bound on how 
quickly we can transfer information from one system into 
the other. In turn however, note that a bound on how 
quickly transformation can be transferred does translate 
into bounds on the speed of evolution as well and one can 
think of the speed of dynamical evolution when applied 
to a computation |33j as being limited by how quickly 
one can transfer the necessary information required for 
the subsequent stage of computation. 



B. Time-dependent Tsirelson-bound 

As another example on how our bound can be used 
we will derive a time-dependent Tsirelson's bound |23| 
for the Bell inequality 21 known as the CHSH inequal- 
ity 



1. CHSH as a game 

We briefly describe the CHSH inequality in its more 
modern form as a game involving two distant players, Al- 
ice and Bob. A detailed account of this formulation and 
how it allows us to recover the original form of the CHSH 
inequality can for example be found in [35j . In the CHSH 
game, we imagine that we pose a question y G {0, 1} to 
Alice and a question z e {0, 1} to Bob, chosen uniformly 
at random, i.e., p{y) = p{z) = 1/2. These questions can 
be identified with the choice of measurement setting in 
the usual formulation. Alice and Bob now return answers 
a G {0, 1} and b G {0, 1} respectively, where we say that 
Alice and Bob win the game if and only if 



y ■ 



b mod 2 



(51) 



Alice and Bob may thereby agree on any strategy be- 
forehand, but they can no longer communicate once the 
game starts. In the quantum setting, this strategy cor- 
responds to a choice of shared state and measurements, 
and in an experiment the no-signaling assumption is em- 
ployed to enforce their inability to communicate. Clearly, 
one may write the probability that Alice and Bob win for 



a particular strategy as 

Pwin = j ^ ^ Pi[a,b\y,z] , 



(52) 



y,ze{0,l} a,b 

a-\-h—y-z 



where Pr[a, b\y, z] denotes the probability that Alice and 
Bob return answers a and b given questions y and z. For 
any classical strategy, p^in < 3/4 but quantumly there 
exist a strategy that achieves pwin = 1/2-1- 1/(2^/2) « 
0.853. This is in fact optimal, since Tsirelson has 
shown [231 136] that for any quantum strategy 



< 



1 



1 

2V2 



(53) 



2. Strategies and state discrimination 

For our purposes, it will be convenient to employ a 
simple observation about what Bob has to do in order 
to produce the right answer in the game, which was de- 
scribed in more detail in |35] . Let py a denote the state 
of Bob's system conditioned on the fact that Alice re- 
ceived question y and has given answer a. Note that 
Bob's system will be placed in this state with probability 
p{y, a) = p{a\y)/2. For z = 0, (51 1 the rules of the game 
state that Alice and Bob win if and only if Bob returns 
the same answer as Alice, that is, b = a. In other words. 
Bob would like to determine, which of the following two 
states he is given 



/ 2=0,0 
[% Po,o 



2=0,0 



,2=0 



2=0,1 , 2=0,1 , 

9o Po,i + qi piA , 



where 



gr°'"=P(0|2/)/(p(0|0) 
grO-i=p(l|y)/(p(l|0). 



-Pirn) 
-pm) 



(54) 
(55) 



(56) 
(57) 



and the probability of cr^"" is given by p^^ — (p(x|0) -I- 
p{x\l))/2. That is. Bob would simply try to extract clas- 
sical information stored in quantum states, which is ex- 
actly the setting that our bound applies to. Producing 
a classical outcome on the ancilla system is very natural 
in this setting as we can imagine that when giving his 
answer Bob simply returns his ancilla to a referee who 
decides whether Alice and Bob win [3S]. Similarly, if 
z = \ Bob would like to determine which of the following 
two states he is given 



2=1 / 2=1.0 , 2=1,0 \ 

o-o — \% Po,o + qi Pis j , 

2=1 / 2=1,1 

0-1 — [qo po,i 



(58) 



ql ^'Vi,o) , (59) 
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where 



'Z^=o'°=P(0|0)/b(0|0) 

'z^=!'°=p(i|i)/(Mo|o) 
Ci''-p(o|i)/(Mi|o) 



the probabiHty of (Tq~^ is Pq~^ 
and the probabihty of crf^^ is pf ^ 



-P(0|1)) , 
-P(0|1)) , 

(p(0|0) - 
.(p(l|0)- 



(60) 
(61) 
(62) 
(63) 

Ml|l))/2, 
-M0|l))/2. 



The probabihty that Ahce and Bob win the game for a 
particular strategy can now be expressed as 



= 1 E Psucss{X^\En , 



(64) 



ze{oa} 



where we write Pguoss(-'^^|£'^) for Bob's success probabil- 
ity in solving the state discrimination problems described 
above for z G {0,1}. From this perspective, Tsirelson's 
bound provides us with an upper bound on how well we 
can solve these two problems on average. 



3. A time limited game 

In the usual setting of this game, Alice and Bob are 
essentially given an unlimited amount of time and en- 
ergy to produce their answers. But how well can they do 
given only a limited amount of energy and time? Here, 
we consider a time-limited version of the CHSH game, 
in which Alice and Bob are given a fixed time t to pro- 
duce their answers. If no answers are given at time t, we 
automatically rule that Alice and Bob loose. Our goal 
will be to derive a time-dependent version of ( 53 ) . For 



simplicity, we will thereby assume that Alice has an es- 
sentially unlimited amount of energy at her disposal and 
only Bob will be restricted in some fashion. Given the 
perspective that Bob has to solve a state discrimination 
problem to produce the right answer as explained above, 
it is clear that we can use our general bound to address 
this setting. The use of an ancilla register is very natural, 
as we can view it as the message system holding Bob's 
answer that is returned to the referee. 

In the usual scenario, Alice and Bob can choose which 
state to share at the start of the game as part of their 
strategy. Note, however, that we cannot allow arbitrary 
starting states to begin with, as we want to put a limit on 
the energy that Bob has at his disposal. For simplicity, 
however, we will make the sole assumption that Bob's 
Hamiltonian is bounded as |j_ff|joo. In the appendix, we 
will derive a general time dependent Tsirelson bound 
from this assumption where we will need our general- 
ization of the time bound for two input states to the case 
of non-uniform input distributions. 

Here, we will focus on the essential idea that underlies 
this bound which already becomes apparent if we con- 
sider a slightly simpler scenario in which Alice's marginal 
distributions are uniform {p{a\y) = 1/2 for all y). This 



scenario is well motivated if we imagine that there is a 
source supplying Alice and Bob with the maximally en- 
tangled state which lies outside of their control, and their 
strategy is restricted to their choice of two-outcome ob- 
scrvables. In this case, Alice's outcome distribution will 
either be deterministic or uniform. In the deterministic 
case, Alice essentially plays a classical strategy. To ob- 
tain a quantum advantage in the case of unlimited time, 
Alice's outcome distributions will be uniform, and we will 
hence focus on this case. 

To obtain a time-dependent Tsirelson bound, we now 
employ our simple bound involving the original trace dis- 
tance of the two states that we wish to discriminate ( 34 1 . 
We have by Tsirelson's bound that 



J2 PguessiX^lE') < 

:e{o,i} 



1 



1 



2^2 



and hence by ([22]) and the fact that p{a\y) — 1/2 



(65) 



(66) 



otherwise there would exist a better strategy for Alice 
and Bob at long times. So we have from (34) that 



< 



1 



7^l|g||oc 



(67) 



In particular, this means that if we allow only a lim- 
ited amount of energy by Bob (e.g., by demanding that 
1 1 I loo = 1), then Bob needs time at least 



t > 



l\\H\ 



(68) 



to achieve the optimum quantum violation of CHSH. 
Note that to achieve the optimum quantum violation, 
Alice's marginals will in fact be uniform, and hence this 
is indeed the minimum time required. 

Clearly, for small time frames, it would be better for 
Alice and Bob to play a classical strategy in which Bob 
can just return the ancilla |0) "as is" to the referee. The 
tradeoff betweeen the classical and quantum strategies in 
our setting can be captured when considering arbitrary 
distributions, which we will address in the appendix. In 
particular, we will show that 

Corollary IV. 1. Let Bob's Hamiltonian be scaled such 
that H > 0. Then the maximum success probability of 
winning the CHSH game for Alice and Bob in time t 
obeys 



Pv^in 



3 

< - 



7t||7J|j 



where 



7 := 



S/tt if 1 < tEn/h < 4 
3/7r otherwise . 



(69) 



(70) 
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We could also derive a more general bound in terms of 



Bob's average energy using (40). However, such a bound 
does not compare easily to the original Tsirelon's bound. 



Of course, the minimum time ( 68 1 is extremely small, 



and irrelevant for any practical tests of CHSH. Indeed, 
it is not our intention to question the validity of present 
CHSH experiments or suggest any loopholes caused by 
an insufficient distance for Alice and Bob compared to 
the time it takes them to achieve Tsirelson's bound. In- 
stead, we provided the present analysis as an illustrative 
example of how our bound applies. 

We would like to point out that (|69]) tells us that the 
strength of non-local correlation is indeed a function of 
time. Furthermore, (68 1 tells us that there exists a funda- 



mental time required to establish maximally strong quan- 
tum correlations. Finally, we note that one can also in- 
terpret (69 1 in another way: Let's suppose that we were 



to fix a time t and observe that Alice and Bob tend to 
win the game with probability at least q. We can now 
rewrite (69) to obtain a lower bound on ||_ff||oo. That is 



we can conclude that Bob had a certain energy at his dis- 
posal, and the strength of non-local correlations in this 
setting provides us with a form of "energy witness" for 
Bob. This also holds for the most general case discussed 
in the appendix. 



V. DISCUSSION 
A. Choice of initial state 

We obtained a series of simple bounds on how well we 
can recover classical information stored in a quantum sys- 
tem within a certain timeframe. Let us now first consider 
what role the choice of initial state of the ancilla played in 
our bounds. During our discussions we assumed that the 
ancilla started out in the classical state corresponding to 
the most likely symbol a;„iax- This reflects the fact that 
the distinguisher does have full knowledge not only about 
the states themselves, but also about the distribution 
Px- In particular, this means that without touching the 
quantum register, he can always achieve a success proba- 
bility ofpx„-,^^ by outputting a;max- Clearly, we could have 
chosen any other classical symbol as our starting point, 
and our bounds can easily be adapted accordingly. This 
holds even for an arbitrary pure state of the ancilla. Yet, 
such a choice does not reflect the distinguisher's apriori 
knowledge. 

Another option would be to let the ancilla start out in 
a special blank state, which intuitively corresponds to an 
outcome of "don't know" . and is orthogonal to any other 
outputs. It is straightforward to apply our methods to 
obtain a similar bound for this case. Yet, note that using 
a blank ancilla state is conceptually rather different since 
it means that we essentially neglect the apriori knowledge 
that a distinguisher has available. 



B. Input size 

Our bound is especially useful, if we are merely 
concerned with the probability of success that can be 
achieved withing a certain time t in principle, using any 
physically allowed operation H. This is indeed interest- 
ing when we consider the problem of Bell inequalities 
where we wanted to obtain a bound on how well Alice 
and Bob can violated CHSH within a given time frame, 
when they can choose any Hamiltonian they like subject 
to energy constraints alone. In particular, we would like 
to emphasize that the time required to acquire classical 
information in our setting is not limited by the size of 
the alphabet X, but merely by the choice of encodings. 
In practise, however, there are much more stringent con- 
straints on how quickly information can be transferred 
that depend on the geometry of the ancilla, leading to 
additional constraints on the interaction Hamiltonian H. 
For example, it could be that H can consist only of two 
qubit interactions, and interactions between the encod- 
ing system and the ancilla are limited to their bound- 
ary. In this case, the size of the alphabet X clearly 
does matter, and stronger bounds therefore should de- 
pend strongly on the exact form of H. We note that 
some bounds on time scales for particular Hamiltonians 
H do follow from the decoherence and thermodynam- 
ics literature [211 [29] for pure state encodings, yet since 
such bounds typically involve ||^fint||oo, where Hint is the 
interacting part of H they offer little advantage in our 
setting. To see how such bounds are related to ours is 
most easily seen when considering the conditional von 
Neumann entropy ll{X\E). Note that if all Px are pure 
the overall cqq-state pxEA 40J is pure as well. Hence, 
ll{X\E) = U{XE)-}i{E) = H(yl)-H(i;). To determine 
how ll{X\E) can change with time we would thus like to 
determine how the entropy of the reduced systems A and 
E evolves with time which has been studied for the von 
Neumann entropy in the decoherence literature where an 
upper bound for the rate of change in entropy was ob- 
tained in terms of H^^intlloo |28j . Similar considerations 
can be made for other entropies |37) . It is an interesting 
open question to obtain good bounds on such quantities 
for arbitrary H that take more of their structure into 
account. 



C. Open questions 

Clearly, this is not the only interesting open question. 
Closely related is the question of how much time is re- 
quired to demonstrate non-local correlations if Alice and 
Bob are yet more restricted. Again, this could take the 
form of physical constraints on the ancilla, or be consid- 
ered in the framework of circuit complexity where one 
cares about the number of two qubit interactions, i.e., 
gates, that they have to apply. The example of CHSH is 
too small for such constraints to make a difference, but 
do play an important role when considering more com- 
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plicated inequalities. 

Furthermore, it would be nice to see if the slight gap 
between our bound and the Margolus-Levitin theorem 
can be closed completely using a more stringent analysis 
for the case of orthogonal encodings px- In particular, 
this means that one would rederive the exact form of the 
Margolus-Levitin theorem from the rate of information 
transfer alone. 
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Appendix A: Basic observations 

In this appendix, we provide the technical details of 
our claims. To this end, we first establish two simple 
lemmas from which we later derive all our results. Since 
we will use these in everything that follows we consider 
the generalized problem where we wish to distinguish 
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N states po, ■ ■ ■ ,pn-i- The first lemma will be used to 
bound the success probabilies using measurement opera- 
tors = l(g)P^ + Wx where the label x S {0, . . . , N- 1} 
corresponds to one of the N states we wish to idenitfy. 

Lemma A.l. For any Hermitian operator A g yB('Hin) 
with diagonalization A = X]j '^''^'^ '^'"^V ^ ^ 
{0, . . . ,-/V— 1} the operator A :— A® |x„iax) (xmaxi satisfies 

tr (W^,i) < 2^(1 ~ cos{tE.nin)){E,,\A.,\En) , (Al) 



where 
Ax ■= 



\A\ $5 |x,iiax) (XmaxI for X — a;niax 7 

A+ (g) |x,nax)(xmax| Otherwise , 



A+ := Xj\uj){uj 

i,Aj>0 



(A2) 
(A3) 



Proof. Using the definition of Wx from ( 18 ) we evaluate 
the terms involving and separately. Let us now 
first bound the term involving W^. For x ^ a^max we 
have that 

tr (W^A) 

= tr ((I ® Px)RA^ + tr (i?^^ (I ® P^)i) (A4) 
- , (A5) 



where we used the linearity and cyclicity of the trace, as 

well as the fact that -RE|Xinax)(Xmax| = |Xmax)(Xmax|P!c = 

for all X ^ Xinax- Let A' := Y,j^x^^o\uj){uj\, and 
define A+ := A+ <^ |x,nax) (xmaxi and A^ := A^ <Si 
|xmax)(xmax|- Note that A ^ A+ - A^ . For x = Xmax we 
can now use the fact that 



R = U-I^ J2i(iM~^tE„/h) - l)\En){E„ 



to write 
tr 

= tr 
= tr 



({1 (g) Px^^^jRA^ + tr (^R^{1 ® Px^^JA 

OO 

Y{exp{~itEn/h) + cxp{itEn/h) - 2) 

71=0 

{En\A\E„) 

OO 

2Y,icos{tE,Jh) ~ l){E„\A\En) . 



(A6) 

(A7) 
(A8) 

(A9) 
(AlO) 



where the fourth equality follows from Euler's formula, 
and the first inequality from the fact that cos{tEn/h) — 
1 < and A+,A- > 0. 

It remains to bound the term involving W^- First of 
all, since A{X) — RXK) is a positive map [32 , and 
A+ ,A^ > 0, we have that 



RA+R^ > , 

rA-r^ > . 



(A13) 
(A14) 



Note that for any X,Z > 0, we have tr(AZ) > 0, 
and hence tr (^{1(bPx)rA-r^^ > 0. Second, note that 
RR^ = and we have 

RR^ = 



= ^ (2 - cxp{itEn/h) - cxp{-itEn/h)) 



ji=0 



\En){En\ 

2^(1 -cos(i£;„/fi)) 



(A15) 



(A16) 
(A17) 



where the second equality follows by applying Euler's for- 
mula. We thus have 

tr (W^A) (A18) 

= tr (^(I ® - tr (^(I ® 

< tr (^R^rA+^ (A19) 

= 2^(1 - cositE,Jh)){E^\A+\E,,) (A20) 



where the first inequality follows from (A13), the fact 



that < 1(g) < I and the cyclicity of the trace, and the 
last equality from (A15). Putting everything together. 



tr{WxA) = tr{Wl^A) + ti-{W^A), we obtain the claimed 
result. □ 

We will also make repeated use of the following bound. 
Note that whereas the bound applies to a very large range 
of values En > 0, we will later be particularly interested 
in the case of tEn/h < 1. Indeed the bound below is 
a great overestimate if tEn/h > 27r, as 2(1 — cos(fc)) < 
7(fc- 27r[fc/(27r)J). 

Lemma A.2. Let En > 0. Then 2(1 - cos{tEn/h)) < 
jtEn/h where 



S/vr i/1 < tEn/h < 4 
' ' 3/n otherwise . 



(A21) 



= 2j2icOs{tEn/h) ~ l){En\{A+ ~ A-)\En 

OO 

< 2^2^^ - COs{tEn/h)){En\A-\En) , 
n=0 



(^All) Appendix B: A bound for two states 

We now first consider the case were we are given just 
(A12) two states, po and pi. Here, we will consider the most 
general problem where po and pi can be arbitrary. 
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1. A bound in terms of the trace distance 

First of all, note that even for a general distribution 
{Px}x the problem of distinguishing two states is easy 
to analyze P|. In particular, we have that in the time- 
unlimited case for measurement operators acting directly 
on the encoding space 

Pg^^ss{X\E) = max po tr(Mopo) + Pi tr(Afipi) (Bl) 



Mo, Ml 

= Po + maxtr {Mi{pipi ~ poPo)) 

All 

= Po + A(pipi,poPo) , 
where A(pipi,poPo) is given by 

MpiPi,PoPo) = 

= tr(A+) 



max tr (PA) 

0<P<I 



(B2) 
(B3) 

(B4) 
(B5) 



where A :— pipi — popo with diagonalization A — 
X;^. Aj|uj)(uj| and A+ = J2j,Xj>o\'^j)(^3\ (Note that A 
is not symmetric here and hence formally does not form 
a distance measure.) Similarly, we have 



A(poPo,PiPi) = tr(A-) . 



(B6) 



Note that for the time unlimited case, we could have 
equivalently expressed the success probability as 



Pgucss{X\E) =pi + A{pqPq,piPi) 



(B7) 



It will also be useful to note that for — Px 

l^max) (^max 1 1 



A(piPi,PoPo) = A(pipi,popo) • 



(B8) 



Before stating our bound, let us introduce some addi- 
tional notation. For two states, define 



:= 1 - Xn 



(B9) 



We now first relate the problem of discriminating the two 
states in time t to the original success probability. 

Lemma B.l. The probability of distinguishing po ^.i^-d Pi 
given with probabilities po and pi using the Hamiltonian 
H = '^n-^n\En){En\ > is bounded by 



PsucciX \E)H,t < P:r„,..+ 

2(1 - C0s(iCniax/^))A(p3;,^i„p^^j„,P:r„ 



(BIO) 



where Cr. 



argmax^ (1 — cos(ti?„//i)). 



Proof. Using ( Bl ), ( 17 ) and the fact that Px,„i„ |xniax) = 



we may bound the success probability as 



Ps ucc {X\E)H^t < P.„_ + 
tr(W^.„.„(p 



(Bll) 



Applying Lemma [A.l| for A 

have that 



tr(W":,.„^,„(^(g) |x„iax)(Xmax|)) 

< 2^(i-cos(i£;„/;i))(£;„|A+^|x„ 



(B12) 

■c) (^max| |-^n) ■ 



Hence from ( B5 1 and ( B8 1 we have 



tr(W^,_(p )) (B13) 

< 2(1 - COs(tCmax/^))A(p2;,^;„Pj;,^;„,Pa;,^^^p^,^^^) . 



Our claim now follows by plugging this bound into (Bill 



□ 



A.2| one may now also use 



With the help of Lemma 
the fact that Vi?„,i?„ < |oo to obtain a very simple 
bound in terms of the spectrum of the Hamiltonian. 

Corollary B.2. The probability of distinguishing po and 
pi given with probabilities po and pi using the Hamilto- 
nian H ~ X^ri -^"1^") (^"1 ^ bounded by 

Psucc{X\E)H.t < P.„_+ (B14) 
jt\\H Woo A{Px,„i,,Px,„i„ 



2. A bound in terms of the average energy 



Inspecting the proof above with Lemma A. 2 in mind, 
it is indeed easy to see that we can also obtain a bound 
in terms of average energies. We first derive a somewhat 
stronger bound for two equiprobable states that actually 
depends on the "average energy" of a function of both 
states. 

Theorem B.3. The probability of distinguishing po and 
pi given with probabilities po and pi using the Hamilto- 
nian H — J2n ^n\En) {En\ > in time t is bounded by 



Ps ucc {X\E)H^t<Px^^^ + 

it 

— [tr(i/|p^„^j„Pj;„i„ -Px„axPx„ax 

Px^i^ tr(i7/5^„,j„) - Pj;,^_^^ tr(i7p^„ 



(B15) 



where 



rj/niflK tEn/h < 4 
' ' S/vr otherwise . 



(B16) 



Proof. Recall that by applying Lemma |A.1| with A ~ 

PXrai-aPxrair, ~ Px^^^^Px^^^ WC haVC that 

)) (B17) 

< 2^(1 - COs{tEn/h)){En\A+ (g, |Xniax>(Xmax||^^«) ■ 
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We may now use Lemma A. 2 to obtain 



(W^:r„,i„ (Px„i„/0:r„i„ - Px„,^^Px,^^J) (B18) 



7itr(i/A+) 



Our claim now follows by noting that 



= ^ (^A+ + A-j+^{A+-A- 



1 



1 



(B20) 
(B21) 

□ 



Appendix C: A bound for many input states 

Finally, we derive a bound for the most general case of 
distinguishing states poi • • ■ , Pn-i where we are given 
with probability Px ■ 

Theorem C.l. Suppose H > 0. Then the probabil- 
ity of distinguishing po, . . . , pn-i given with probabilities 
Po,...,Pn-i obeys 



N-l 



PsncciX\E)H^t < P.„_ + jrY.P-^'' ^^'P-) ' (CI) 

where 



x=0 



7 



5/7r ifyEnA < tEn/h < 4 
3/tt otherwise 



(C2) 



Proof. Note that the success probability for a particular 
interaction H is now given by 



Psncc{X\E)H,t - J2 P^^'^i^'I^f 



(C3) 



a;=0 



where 



tr (MxPx) = tr ((I (g, Px)px) + tr {W^Px) ■ (C4) 

Let us now first consider the case of a; = Xmax- We have 
that 

tr((I®P,)p,) =tr(p,) 1 . (C5) 
Using Lemma |A.1| with A — px "we hence have that 

tr(M,p,)< (C6) 

i + 2^(i-cos(i£;„/;i))(£;„|p,| En) ■ 



We now turn to the case oi x ^ aJmax- Since 

-^a; l^max) (Xinax I — l^niax) (^niax | -^a; — and Px — Px ® 
|Xmax)(Xmax| We ha vC {l ®Px)px = for all X ^ Xinax- 

Again by applying |A.1| with A = px we obtain from 
Px>^ that 



(B19) ^^{WxPx) < 2^(1 - cos{tEn/h)){En\px\En) ■ (C7) 



Our claim now follows by using Lemma |A.2| to obtain 
2(1 - cos{tEn/h)) < -ftEn/h for En > 0. □ 



Appendix D: Attaining the bound 

We now exhibit a Hamiltonian that achieves our upper 
bound for the success probability of distinguishing two 
states given with apriori equal probability. 

Theorem D.l. Suppose we are given po and pi with 
apriori equal probability. Let -Emax > 0. Then there ex- 
ist a Hamiltonian H with ||-ff||co = ^-max tho,t in time t 
achieves success probability 

PsucciX\E)H^t = 1 + 1(1 _ cos{t\\H\\^/h))D{po, Pi) . 

(Dl) 

In particular, we can distinguish the two states perfectly 
in time t — h-K /E-c^ay.. 

Proof. Let A = pi — po. We can diagonalize A = 
Aj|uj)(wj |, and define A+ := Ej,a,>o and 
A^ -.^ A — y4+. Consider the operator 



H -.= 11^- «)i + n^+ ® (|o)(i| + |i)(o|) 



(D2) 



where and 11^- are projectors on the support of 

yl+ and A^ respectively. Clearly, H is Hermitian and 
unitary, and hence has eigenvalues ±1. We now define 
the Hamiltonian H 



H ■.^E^a^{H + I)/2 



(D3) 



Since any term in the Hamiltonian proportional to the 
identity does not affect the dynamics, we may replace 
the time evolution operator exp(— with 



oo ^ 

U = exp{^it\\H\\^H/2h) - y - 



2h 



For our choice of H we have that 

{Hr - 



11 n even 
H n odd . 



(D4) 



(D5) 



and as a result the Taylor expansion for U gives 

U = {cos{t\\H\\^/2h)I -ism{t\\H\\^/2h)H) . (D6) 
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Using this unitary in our state discrimination problem, 
we obtain 



Psucc{X\E)H.t 

1 1 



(D7) 



tr(C/t(I®Pi)t/(pi-/5o)) 



2 2 

1 , sm''{t\\H\\^/2h) 



tr 



(iJ(l0Pi)iJ(pi-po) 



(D8) 



It remains to evaluate the last term. First of all, note 
that 



H{l(E) Pi)H = (n 



A+ 



+ n^- . (D9) 



Since pi - po = {A+ - A-) (g) |0)(0| and H^+H^- = 
11^1-11^+ — 0, we thus have 



tr 



(^H{I ® Pi)H{pi - po)) = tv{A+) (DIO) 
= Dipo,Pi). (Dll) 



The claim follows by an application of the double angle 
formula. □ 



Appendix E: Constraining the eigenvalues of H 

For completeness, we now remind ourselves why in 
many settings it is not unreasonable to assume that 
||i?||oo is indeed bounded. Note that when dealing with 
fixed input states po, . . . ,pn~i, we can without loss of 
generality assume that the Hamiltonian H that leads to 
the optimal success probability possible within a certain 
time t is limited to the energy eigenspace sufficient to 
contain the support of the inputs states and the stan- 
dard ancilla state. This holds even in an approximate 
sense. To see this, consider how the state on the encod- 
ing register and the ancilla evolve in time 



Po(t) U{t)poU{t)^ , 
pi(t) ■.= U{t)hU{t)^ . 



(El) 
(E2) 



Choose an error parameter e and define n| to be the 
lowest rank operator such that [n|,i/] — and 



|po(o)-n|po(o)nf||i <e 
|pi(o)-n|pi(o)nf||i <e 



(E3) 
(E4) 



Since [n|,i/] = and the Ll-norm is unitarily invari- 



ant, we can immediately conclude that (E3) still holds 



when we replace po(0) and pi(0) with any subsequent 
states /9o(t) and pi{t). However, this tells us that we can 
approximate U with a unitary U as 



U{t) := eM-iHt/h) 



(E5) 
(E6) 



without affecting any of the output states, except with a 
chosen error e. By the definition of the Ll-norm we have 
for any Hermitian operator A that 



A||i = sup tr(Fyl) 

-i<p<i 



(E7) 



and hence ( E3 ) implies that using the unitary H in place 



of any original H leads to a change in success probability 
in the state discrimination problem of at most 2e. 



Appendix F: A general time-dependent Tsirelson's 
bound 



Let us now consider a more general version of our time- 
dependent Tsirelson's bound in which we drop the as- 
sumption that the source emits a particular state, and 
that Alice makes a two-outcome projective measurement. 
The only assumption we will make now is that Bob's 
Hamiltonian is bounded ||i?||oo = -E-max- 

For our proof, we will need the more general version 
of the two state discrimination problem in which the two 
states are not necessarily given with equal probabilities 
(see Corollary[R2|. A gain, let us first briefly consider the 
time unlimited case, where Mq and Mi are just measure- 
ments on a single system. Recall that we could express 
the success probability of distinguishing po and pi given 
with probabilities po and pi respectively as 



-Pguoss(-'^I^) =P0 + A(piPi,PoPo) 



(Fl) 



At first glance, this expression appears a bit assymetric 
- after all, what should be so special about pq7 Note, 
however, that by replacing Mi = I — Mq in (Bl) we 



could also have expressed the success probability as 

-Pgucss(-'^|£^) ^Pl + A{poPo,pipi) . 

In particular, it will be convenient to note that we could 
have also written the success probability as the average 
of these two terms 



P., 



1 



1+ ^ A{psPx,Px 



(F2) 



xe{o,i} 



Let us now return to the time limited case, involving an 
interaction of the encoding and ancilla system, followed 
by a measurement on the ancilla. Recall that we have 
from Corollary |B . 2| that 



(F3) 



MPx^i„Px^i„,Px 



.Px„ 



Note that in the time limited case we cannot simply aver- 
age - the proof of Corollary |B.2| yields a different bound 
had we placed Px„i„ in front (a small calculation shows 
that it will again single out Px^^^ ) ■ We are now ready to 
show our general bound, where we will use the notation 



developed in Section IV B 2 
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Lemma F.l. Let Bob's Hamiltonian be scaled such that 
H > 0. Then the maximum success probability of winning 
the CHSH game for Alice and Bob in time t obeys 



where we have used the shorthand 



P ■ < - 



V^e{o,i} / 



-ft\\H\\ 



V2h 



wher 



5/n ifl<tE,Jh<A 
^ ' ' S/tt otherwise . 



(F4) 



(F5) 



Proof. We have from Tsirelson's bound '53] that for any 
strategy of Ahce and Bob no matter how much time or 
energy they may have available 



Pwin 



ze{oa} 

z \ xe{o,i} J 



< 



1 



2 ' 2V2 ■ 
Rearranging terms gives us 



z,a;G{0,l} 



1 

2^2 



Since A(-, •) > 0, this means that 



(F6) 

(F7) 
(F8) 

(F9) 
(FIO) 

(Fll) 



Our claim for the time limited case now follows by plug- 
ging this bound into (IFSI) 



-Pwin 



(F12) 



2e{o,i} 




(F13) 



(F14) 



C 



tl\\H\\ 



(F15) 



□ 

At first glance, this bound may seem somewhat strange 
as it involves a potentially unknown term ^^Px^^^- 
Note, however, that this term is determined by the dis- 
tributions over the states that Bob should distinguish. It 
is this distribution, that determines the classical bound 
for CHSH and hence 



< 



(F16) 



Note that since the ancilla is initialized to the most likely 
X in each case, there exists a strategy for Alice and Bob 
with which they can play optimally classically in no time 
at all. Yet, since there is generally an interplay between 
the choice of distributions and the state Alice can create 
we derived the bound in its more general form above. 



Corollary F.2. Let Bob's Hamiltonian be scaled such 
that H > 0. Then the maximum success probability of 
winning the CHSH game for Alice and Bob in time t 
obeys 



< 



lt\\H\U 

V2h 



(F17) 



where 



r 5/7r ifl < tEn/h < 4 
' 1 S/tt otherwise . 



(F18) 



